Add vLLM support #524

peerschuett · 2025-07-24T15:26:02Z

vLLM doesnt work at the moment, because of the MaxTokens parameter. It is currently set to -1, but vLLM requires it to be >0.
We have removed the MaxTokens parameter initially and now need to test if this is the best solution.

…the value to -1.

peerschuett · 2025-07-25T07:31:49Z

I tested LM Studio, ollama and vLLM with this setting and they all generate responses.

Removed the MaxTokens parameter, because vLLM doesnt support setting …

ca267cf

…the value to -1.

peerschuett self-assigned this Jul 24, 2025

SommerEngineering and others added 4 commits August 10, 2025 16:01

Merge branch 'main' into vLLM

6375ba3

Renamed V_LLM to VLLM

29e3026

Updated changelog

0bc96aa

Added support for vLLM and updated provider details

e0565d3

SommerEngineering requested a review from a team as a code owner August 10, 2025 14:21

SommerEngineering merged commit b75d90b into MindWorkAI:main Aug 10, 2025

peerschuett deleted the vLLM branch August 11, 2025 07:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add vLLM support #524

Add vLLM support #524

Uh oh!

peerschuett commented Jul 24, 2025 •

edited

Loading

Uh oh!

peerschuett commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Add vLLM support #524

Add vLLM support #524

Uh oh!

Conversation

peerschuett commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

peerschuett commented Jul 25, 2025

Uh oh!

Uh oh!

peerschuett commented Jul 24, 2025 •

edited

Loading